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■ Abstract 

(N 

A Direct Sum Theorem holds in a model of computation, when solving some k input instances 
together is k times as expensive as solving one. We show that Direct Sum Theorems hold in the 
models of deterministic and randomized decision trees for all relations. We also note that a near 
optimal Direct Sum Theorem holds for quantum decision trees for boolean functions. 

■ 1 Introduction 

u 

One of the goals of complexity theory is to understand the structural properties of different models of 
O . computation. A fundamental question that can be asked in every model of computation is how well 

different computations may be combined. Can we achieve substantial savings when solving the same 
problem on k (independent) inputs together? Or is the straightforward approach, namely running 
the same algorithm k times, optimal? This question is known as the direct sum problem, and has 
been studied in many different settings and variations. 

We say that a Direct Sum Theorem holds for a measure of complexity, when solving k input 
instances together is roughly as costly as k times solving one instance according to that measure. 
Since we are often interested in bounded error computations, we also need to specify how the error 
on k input instances relates to the error on one instance. The direct sum question in a narrower 
sense relates to solving k instances with constant error, while a Strong Direct Product Theorem holds 
when even using roughly k times the resources required to solve one instance with constant error, 
the success probability goes down exponentially in k. This happens when we solve the k instances 
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independently, and a Strong Direct Product Theorem states that this is optimal with respect to 



resources and error. In this paper we only consider the direct sum problem in the narrower sense: 
we compare solving one instance with constant (resp. no) error to solving k instances with constant 
(resp. no) error. 

The decision tree model (see [BW02| ) is perhaps the simplest model of computation, measuring 
the number of input positions that need to be accessed in order to compute a function/solve a 
relation. Still many questions about this model remain open. In this paper we show that the direct 
sum property holds for deterministic and randomized decision trees. 
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Previously, a Strong Direct Product Theorem for decision trees was established by Nisan et 
al. [NRS94| . However, their result does not imply a Direct Sum Theorem in our sense, because it 
is only shown in a weaker setting. Instead of analyzing a single algorithm that has access to all k 
inputs and produces all k outputs, Nisan et al. consider a setting where k algorithms (that can access 
all inputs), each making at most d queries, compute one of the k outputs each, where d is the query 
complexity of computing one instance (with bounded error). Hence this does not establish a Direct 
Sum Theorem in the above sense. 

Previous papers \RS!) I. BN95] have dismissed the direct sum problem for decision trees as 
either very simple, or uninteresting. To quote [NRS94]: "While it is an easy exercise to see that 
direct-sum holds for decision tree depth, the other two problems (direct product and help bits) are 
more difficult." The paper does not make it clear, what kind of decision tree is meant (the setting 
considered there is distributional complexity) . In the distributional setting a general counterexample 
by Shaltiel [ShaOl] makes it clear that some very tight direct sum statements are not even true for 
the model where there is one decision tree that has to solve all k input instances together. 

Ben-Asher and Newman claim in [BN95] : "In the standard decision tree model the question is 
quite uninteresting as queries do not involve variables of more than one of the problem instances at a 
time." This does not seem to be a valid assessment of the problem, because with the same argument 
the strong direct product question for decision trees could be dismissed, which is as of now still an 
open problem (in the setting where one algorithm makes all outputs). 

We give proofs of Direct Sum Theorems for the case of deterministic and randomized decision 
trees. In the deterministic case the main problem is to construct a more efficient decision tree for 
one instance from a given tree for two instances. In the randomized case the proof is along the lines 
of some proofs of Direct Sum Theorems in communication complexity e.g. [JRS05J. 

One may ask if a similar result is true for quantum decision trees, also known as quantum query 
algorithms. While we do not have a proof for this model, a weaker statement can be derived from 
recent results by Reichardt. In [HO!) he shows that the general quantum adversary bound is tight 
within a logarithmic factor for the quantum decision tree complexity of every boolean function (his 
Theorem 1.4). He also shows that the general adversary bound has a direct sum behavior (see 
Theorem 7.2 in the long version of the paper. Note that one has to choose a good "connection" 
function / like XOR because the adversary bound works only for boolean functions). The direct 
sum for the general adversary bound has also been shown previously in Ambainis, Childs, Le Gall 
and Tani [ACGT09]. Hence we can conclude that in the quantum case, at least within a logarithmic 
factor and for boolean functions, a Direct Sum Theorem also holds. 

2 Preliminaries 

A deterministic decision tree on m variables is a rooted binary tree T whose internal vertices are 
labeled by the boolean variables x%, . . . , x m , and whose leaves are labeled by the output values from 
a set y. For every vertex v in T, we denote by vq (respectively v\) the left son (respectively the 
right son) of v, and by T(v) the subtree of T rooted at v. We set = T(vb), for b € {0, 1}, where 
v is the root of T. The depth dy(u) of vertex v in tree T, is defined recursively: it is if v is a 
leaf, otherwise dx{v) = max{dr(?;o), dr{vi)} + 1. The depth d(T) of T is simply the depth of its 
root. Every tree naturally computes a function fx on m variables, whose value at an assignment 
x = (x\, . . . ,x m ) £ {0, l} m is defined recursively as follows: If the root of T is a leaf, then fr(x) is 
the value of its label. Otherwise, if Xi is the label of the root and X{ = b, then fr(x) = fT b (x). 

Clearly, several decision tree compute the same function /. The deterministic decision tree com- 
plexity of / is the depth of the minimal depth decision tree T such that /t = /, and we denote it by 
D(f)- 
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The above definitions naturally extend to trees whose leaves are labeled by elements of y k , for 
some positive integer k. We call these trees k-output deterministic decision trees, they compute 
k- output functions whose range is by definition y k . We will use the notation / = (fW, . . . , /(*)) 
for /c-output functions, where /W is the function computing the ith output of /. In particular, 
we are interested here in the case when m = kn and the functions do not share common input 
variables. More precisely, let / : {0, — ^ y k be a /c-output function whose input variables are 
xi t \, . . . ,xi <n , . . . ,x kj i, . . . ,x k>n . We set Xi = (x^i, . . . ,Xi tH ), and say that / is k-independent if the 
value of depends only on ctlj. 

One can also extend the definition of deterministic decision trees and /c-independence to relations 
/ Q {0, l} m x y instead of functions in a straightforward way (decision trees are required to find an 
output y for each input x £ {0, l} m so that (x,y) £ /). 

In particular, for a relation / C {0, l} m x y k , the relation fW c {0, l} m x y consists of all (x, y), 
such that (x,yi, . . . , yi-i, y, y i+1 , . . . , y k ) 6 / for some yi . . .,yi-i,y i+1 , ...,y k . 

Note that for inputs x for which there is no y with (x, y) E / no requirement on the output is 
made, and hence we can assume that all relations are total without loss of generality. Since for each 
input only one output can be produced, each deterministic tree automatically computes a function 
that is consistent with the relation in question. 

For a relation / C {0, l} n x y, we define the /cth tensor power of / as the relation f® k C 
{0,l} kn xy k byf® k = {((x l ,...,x k ),(y l ,...,y k )) : Vi : (x hyi ) G /}. Note that f® k is fc-independent. 

A randomized decision tree on m variables is a convex combination of deterministic decision trees, 
such that for each input x a correct output is computed with probability 1 — e for a given error 
probability e. If not mentioned otherwise e = 1/3. For /c-output relations / an output (y\, . . . ,y k ) 
is considered erroneous, if (xi,yi) f^' for some i, i.e., all k outputs are required to be correct 
simultaneously. 

R e (f) denotes the e-error randomized query complexity of /, which is the maximum number of 
queries made by the best randomized decision tree with error being at most e on any input. Let \i be 
a distribution on {0, l} n . Let Re(f) represent the e-error distributional query complexity of /, which 
is the maximum number of queries made by the best randomized decision tree with average error at 
most e under \x (note that such a tree can be assumed to be deterministic w.l.o.g., but sometimes it 
is simpler to give a randomized tree). We have the following fact from [Y83J. 

Fact 1 (Yao's Principle) R e (f) = max M i?^(/). 

3 Direct Sum for Deterministic Complexity 

Let / C {0, l} kn x y k be a /c-output relation. Obviously D(f) < £jU D f(i) since the values can 
be evaluated sequentially. We prove that for /c-independent relations this is in fact the least expensive 
way to evaluate /, that is the inverse inequality also holds. 

Theorem 1 (Deterministic Direct Sum) For every k-independent relation f C {0, l} fcn x y k , 
wehaveD(f)>j: k =1 D(fV). 

Proof Let T be a /c-output deterministic decision tree on variables • • • , x k ^ n }. For i = 1, . . . , k, 

we refer to {xn, . . . , a^n} as the iih group of variables. For every vertex v of T, we define recursively 
k single output decision trees Ti(v), . . . , T k (v), where the vertices of Ti(v) are labeled by the variables 
from the ith group. If v is a leaf with label (pi, . . . , b k ), then Ti(v) is a single node tree (a leaf), with 
label 6j. Otherwise, let v be an internal node and let's suppose that its label is from the jth group of 
variables. The root of Tj(v) is by definition v with the same label as in T, its left subtree is Tj(vo) 
and its right subtree is Tj(vi). For all i 7^ j, the tree Ti(v) is defined as the shallower (smaller depth) 
tree between Ti(vq) and Ti(v\). 
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Claim 1 For every vertex v ofT, we have X^=i d{Ti{v)) < cIt(v). 

Proof The proof is by induction on the depth of v, and the statement is obviously true when 
v is a leaf. We suppose without loss of generality that the label of v is from the jth group. Let 
b G {0,1} such that d(Tj(v)) = d(Tj(v b )) + 1. By definition, for all i ^ j, we have d(Ti(v)) = 
mm{d(Ti(vo)),d(Ti(vi))}, and therefore d(Ti(v)) < d{Ti{v b )). Thus 

k 

52d{Ti(v)) < diT^ + l + J^dmvb)) 

i=l i^j 

< d T (v b ) + l 

< dr(v), 

where the second inequality follows from the inductive hypothesis, and the third one from the defi- 
nition of the depth. □ 

We say that T is parsimonious if no variable appears twice on the same root-leaf path. 

Claim 2 Let T be parsimonious. Then for every vertex v in T , for every 1 < i < k, and for every 
assignment xi G {0, l} n for the variables in the ith group, there exists, for all j ^ i, an assignment 
Xj G {0, l} n for the variables in the jth group such that 

fTi(v)(%i) = /t^)^ 1 ' ■ ■ ■ )^i> ■ ■ ■ i^fc)- 

Proof The proof is again by induction on the depth of v. Fix 1 < i < k. If v is a leaf, we can 
choose for every X{ G {0, l} n an arbitrary Xj G {0, l} n , for j ^ i. Otherwise, we distinguish two 
cases, according to the label of v. 

Case 1: The label of v is Xi jP from the ith group of variables, for some 1 < p < n. Let Xi G {0, l} n 
be an assignment for the variables in the ith group, and let Xi P = b. By the inductive hypothesis 
there exists x'p for j ^ i, such that 

We set Xj = x'j, for j ^ i. Then we have 

fTi(v)(Xi) = f Tl {v b ){Xi) 

= ^T(v b ) (^1 ' " ' " ' " " " ' ^k) 
= ^T(v) ("^1 ' • • • ' ^* ' • • • ' ^fc ) ' 

The first equality follows from the definition of ./W^O^i) since = b. The third equality also holds 
because by definition f T ( v )(xi, ■ ■ ■ , Xi, ■ ■ ■ , x k ) = f T (v b ){x\, ■ ■ ■ ,x~i, ■ ■ -,x k ). 

Case 2: The label of v is Xj tP from the jth set of variables for some j ^ i and 1 < p < n. Let b be 
such that Ti(v) = Ti(v b ). Then again by the inductive hypothesis, for every X{ G {0, l} ra , there exists 
x'p for j ^ i, that satisfy 

We define x\ A for I ^ i and q = 1, . . . , n by 

f& if (/,?) = 
I „ otherwise. 
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Then, similarly to Case 1, we have the following series of equalities: 

fT t {v){Xi) = fT t {v b ){Xi) 

= ■^T(i;i,)(^l' — >^*> — i X k) 
= ^Tlv) ip^-i • • • i x it • • • •> x k)- 

The first equality is true because Ti(v) = Ti(vb). The path followed on input (x\, . .. .. . ,x~k) 

in T{v) goes from v to since Xj yP = b, and then it is identical to the path followed on in- 
put (x[, . . . , Xi, . . . , x' k ) in T(vb) because T is parsimonious. Therefore fT(v b )( x v ■ ■ ■ ■ ■ , x' k ) = 
/t(«)(^i ; • • • j %ii ■ ■ ■ j ^k), and the last equality also holds. □ 

We now prove Theorem Q] by contradiction. Let us suppose that D(f) < Y^H=i^(f )■ Let 
be a deterministic decision tree of depth D{f) which computes a function / that is consistent with 
the relation /. Since T is a minimal depth decision tree computing /, we can suppose without loss 
of generality that T is parsimonious. Let r be the root of T, then d{r) = D(f). For i = 1, . . . , k, let 
Tj = Tj(r). By Claim [2] and /c-independence, Tj computes an /W, which is consistent with /W, and 
therefore £>(/ (i) ) < d(Ti). Thus d(r) < ELi d ( T i)> contradicting ClaimdJ □ 

Corollary 1 For every relation f C {0, l} n x and for every integer k, we have D(f® k ) = k- D(f). 

4 Direct Sum for Randomized Query Complexity 

Theorem 2 (Randomized Direct Sum) Let f C {0, 1}™ x y be a relation. Let k be a positive 
integer and let 5 > be a small constant. Then R e (f® k ) > 5 2 ■ k ■ R e /(f), where e' = + 5. 

Proof Let c = R e (f® k ). Let V be & randomized protocol for f® k with c queries and worst case error 
at most e. Let \x be a distribution on {0, l} n . Let [i® k represent the distribution on {0, l} fcn which 
consists of k independent copies of \i. Now let us consider the situation when we provide input to V 
distributed according to In such a situation we can fix the random coins of V in a suitable manner 
to get another protocol V\ such that E( Xi ... a , fc ) + _^®fc [e(a;i . . . x^)\ < e, where e{x\ . . . x^) represents the 
error made by V\, which is now a deterministic protocol, on input {x\ . . . xt) where each X{ £ {0, l} n 
represents the input for the iih instance of /. For notional convenience we use Xi here instead of X{ 
as used in the previous section. Let q(x\ . . . Xk) represent the number of queries made by V\ on input 
(x\ . . . Xk). For each 1 < i < k, let qi{x\ . . . Xk) represent the number of queries made by V\ on X{ on 
input (xi . . . Xk). Since q(x± . . . Xk) = J2i=i Qi{ x i ■ ■ ■ x k), we have, 

c > E {xi _ Xk) ^^k[q(x 1 ...Xk)] 
k 

= E ( X1 ...X k )<r-l J ,®l°Q2<li( X l--- X k)] 

i=l 

k 

= ^2 E (x 1 ...x k )^«4Qi( x l--- x k)} 
i=l 

Therefore there exists 1 < j < k such that E( X1 ^ iXk )^^k[qj{ x i ■ ■ ■ x k)\ < f • Without loss of generality 
let j = 1. Using this and the fact E^ ^.^^^k [e{x\ . . - Xk)] < e, we can argue by standard applica- 
tions of Markov's inequality that there exist x' 2 ■ ■ ■ x' k £ {0, l} fcn_n such that Kx^^q^xix^ . . . x' k )] < 
jr and K Xl ^^[e(xix' 2 . . . x' k )\ < j^. Therefore fixing x' 2 ■ ■ ■ x' k in V\ naturally gives rise to a protocol 
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V2 for / with expected number of queries under \i being at most jr and expected error under \x 
being at most j^t. Now let us consider a protocol V3 which proceeds exactly as V2 but terminates 
whenever the number of queries exceeds pr. Again, using Markov's inequality, it can be argued that 
the expected error of V3 under [x is at most e' = + 5 and of course the maximum queries made 
by V3 is at most Hence by definition R^,(f) < -Sr. Since this is true for every distribution fi on 
{0, l} ra , we get from Yao's Principle the desired result as follows. 

□ 



5 Open Problems 

We proved direct sum theorems for deterministic and randomized query complexity. Note that it 
is also very easy to establish the direct sum property for nondeterministic query complexity (also 
known as certificate complexity). However, several related open problems remain: 

1. The direct sum theorem in the randomized case loses a factor of 5 2 in the lower bound, as well 
as an additive 5 in the error bound. While at least the factor in the lower bound is unavoidable 
in the setting of distributional complexity according to a result by Shaltiel [ShaOT] , this might 
not be necessary in the worst case complexity setting. 

2. In the quantum case no tight result is known, and the result following from Reichardt's work 
holds only for boolean functions. Can a tight result be established, even for all relations? 

3. Establishing general strong direct product theorems is open for both the quantum and the 
randomized/distributional setting. Note that the result of |NRS94j holds only in the weaker 
model where k algorithms compute one output each. 
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